Comments on: Robust estimation of multivariate location and scatter in the presence of cellwise and casewise contamination

نویسنده

  • Stefan Van Aelst
چکیده

Agostinelli, Leung, Yohai, and Zamar (Agostinelli et al. in the remainder) consider the difficult problem of robust estimation based on high-dimensional data. If outlying values can appear independently in the variables, then it can easily occur that the majority of the observations in high-dimensional data are contaminated, as pointed out in Alqallaf et al. (2009). Consequently, standard robust methods fail in this case, and new methods need to be developed that can handle this type of contamination. Moreover, next to independent contamination also casewise or structural outliers can still appear in the data. This situation was formalized as the partially spoiled independent contamination model in Alqallaf et al. (2009). In their paper, Agostinelli et al. are the first to introduce a consistent estimator of multivariate location and scatter that is highly robust against both cellwise and casewise outliers. The 2SGS is a strongly consistent estimator of the location and shape of general elliptical distributions. Similarly to other proposals, the estimator proceeds in two steps. In the first step, an outlier detection rule is used to identify potential cellwise outliers. A first improvement is the use of a data adaptive cutoff instead of a fixed cutoff value when filtering cellwise outliers. The second novelty is to replace flagged outliers by missing values as first proposed in Danilov (2010) and Farcomeni (2014), while earlier proposals tried to reduce their effect through some form of Winsorization, see e.g., Alqallaf et al. (2002), Van Aelst et al. (2011, 2012),

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

A Two-Phase Robust Estimation of Process Dispersion Using M-estimator

Parameter estimation is the first step in constructing any control chart. Most estimators of mean and dispersion are sensitive to the presence of outliers. The data may be contaminated by outliers either locally or globally. The exciting robust estimators deal only with global contamination. In this paper a robust estimator for dispersion is proposed to reduce the effect of local contamination ...

متن کامل

Stahel-Donoho estimation for high-dimensional data

We discuss two recently proposed adaptations of the well-known StahelDonoho estimator of multivariate location and scatter for high-dimensional data. The first adaptation adjusts the calculation of the outlyingness of the observations while the second adaptation allows to give separate weights to each of the components of an observation. Both adaptations address the possibility that in higher d...

متن کامل

Estimation of AR Parameters in the Presence of Additive Contamination in the Infinite Variance Case

If we try to estimate the parameters of the AR process {Xn} using the observed process {Xn+Zn} then these estimates will be badly biased and not consistent but we can minimize the damage using a robust estimation procedure such as GM-estimation. The question is does additive contamination affect estimates of “core” parameters in the infinite variance case to the same extent that it does in the ...

متن کامل

Simultaneous robust estimation of multi-response surfaces in the presence of outliers

A robust approach should be considered when estimating regression coefficients in multi-response problems. Many models are derived from the least squares method. Because the presence of outlier data is unavoidable in most real cases and because the least squares method is sensitive to these types of points, robust regression approaches appear to be a more reliable and suitable method for addres...

متن کامل

Testing the Exactitude of Estimation Methods in the Presence of Outliers: An accounting for Robust Kriging

Estimation of gold reserves and resources has been of interest to mining engineers and geologists for ages. The existence of outlier values shows the economic part of the deposits subject to the fact that don’t depend on the human or technical errors. The presence of these high values causes a pseudo dramatically increment in variance estimation of economical blocks when applying conventional m...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2015